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£SJ , Abstract. We propose a rigorous analysis approach for the subset sum problem in 

the context of lossless data compression, where the phase transition of the subset sum 
problem is directly related to the passage between ambiguous and non-ambiguous 
decompression, for a compression scheme that is based on specifying the sequence 
composition. The proposed analysis lends itself to straightforward extensions in 
several directions of interest, including non-binary alphabets, incorporation of side 
information at the decoder (Slepian-Wolf coding), and coding schemes based on 
multiple subset sums. It is also demonstrated that the proposed technique can be 
used to analyze the critical behavior in a more involved situation where the sequence 
composition is not specified by the encoder. 
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1. Introduction 

We consider a lossless data compression scheme that builds upon the the number 
partitioning problem and the closely related problem of subset sums: Given a set of 
integers, Oi, 02, . . . , ajv, a* G {1, 2, . . . , L}, i = 1,2, ...,N, the number partitioning 
problem is the problem of finding a subset 5 C {1,2,..., N}, such that the sums of 
{a{\ over S would be as balanced as possible with the sum over the remaining {a{\. 
More precisely, the goal is to find a subset S such that | Ya&s a i ~ J2ies c a i I would be 
minimum, or equivalently, to find a binary vector cr = (o"i, . . . , <tn) G { — 1, +1}^ such 
that I YliLi Q>i<7i\ would be minimum. Perfect partitioning means that this expression is 
exactly equal to zero. The problem of finding an optimum partition is NP-complete 
[6], [11] and it has a fairly long history (see, e.g., (TUJ Section 9.2], J9J Chapter 7] and 
many references therein). For the case where {c^} are drawn independently at random, 
some rigorous results have been obtained using methods of statistical mechanics, see, 
e.g, PQ, [2], [5], [E]. It has been shown (see also [9], [TO], [13]) that for a randomly 
selected vector (01, . . . , a^), and for L = 2 NR (R > 0, constant), there is a phase 
transition at R — 1. For R < 1, there are exponentially many solutions (er- vectors) to 
the number partitioning problem. More precisely, there are exponentially about 2 N ( 1 ~ R:> 
many solutions on the average. However, for R > 1, the probability that there exists 
even one solution decays exponentially. 

In the related problem of subset sums, the scope is extended to the evaluation of the 
total number tt(E) of binary vectors {cr} such that J2iLi cao~i = E, for any given value of 
E in the appropriate range, not only E = 0. Sasamoto [12] proposed a data compression 
scheme based on subset sums in its constrained version, that is, the one where binary 
vectors are sought only among those which have a given composition, namely, given 
numbers N + = Np (0 < p < 1) and N_ = Nq (q = 1 — p) of occurrences of <Tj = +1 and 
ai = —1, respectively, or equivalently, a given value of M(cr) = J2iLi&i — N(p — g)Jj| 
In particular, in view of the above described results concerning phase transitions, 
Sasamoto's insight was that for R above a certain threshold, the mapping between 
the set of binary vectors {cr} of a given composition to the sums E(a) = Y^iLi a i°~i must 
be essentially one-to-one for a typical realization of (ai, . . . , a^). This has lead him to 
propose a lossless data compression scheme that is based on encoding a binary string cr, 
with a composition of iV + = Np and iV_ = Nq, using a binary representation of E(cr) 
plus a relatively small overhead (of log(iV + 1) bits) for specifying the composition of 
cr, or equivalently, the value of M(<x) = Y,iLi°~i- Sasamoto argued that the threshold 
of reliable decoding occurs at R = h{p), where 

h(p) = —plogp — (1 — p) log(l — p) (1) 

is entropy of the binary information source that emits sequences with the aforementioned 
composition (within some small tolerance) with high probability, and so by taking 
R = h(p)+e (e > 0, arbitrarily small) and using the fact that the range of possible values 

I By contrast, in the unconstrained version, solutions are sought across all binary strings of length N. 
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of E(cr) does not exceed N-L, one may encode E(tr) using log(N-L) = N[h(p)+e]+\og N 
bits, and thereby essentially achieve the entropy of the information source. While this 
coding scheme is not very attractive from the practical point of view, the interesting 
point here is the relationship between the phase transition of the subset problem and 
the abrupt passage between ambiguous and non-ambiguous decoding as R crosses the 
entropy, in agreement with Shannon's fundamental coding theorems [3]. 

Sasamoto's approach was to analyze the number Q(E, M) of configurations {cr} 
with E(a) = E and M(er) = M, where E and M are the values pertaining to the source 
sequence & = (<3"i, . . . , &n) that was actually compressed. He argued that for a typical 
realization of {a*}, the behavior is as follows: For R < h(p), Q(E,M) is exponentially 
large and so the decoding of cr, based on E and M, is ambiguous, but for R > h(p), 
the expectation of Q(E, M) is exponentially small, and so the decoding is reliable with 
high probability. In order to assess the number of solutions to the two simultaneous 
equations E(o) = E and M{cr) = M, he applied the saddle point method (see also [TO] . 
|13j). In particular, he first defined a partition function of a Hamiltonian defined by a 
linear combination of E(cr) and M(cr), and then used the integral representation of the 
inverse transform of this partition function, that yields Q(E,M). This integral in turn 
was approximated using the saddle point method. 

The analysis in [12], which relies on the analysis in [13] . raises two technical 
concerns, however. The first is about the validity of the saddle point method in this 
situation: While the saddle point method is perfectly rigorous under the asymptotic 
regime where N — > oo while L is kept fixed, its validity becomes rather questionable^ 
in a regime where L grows with N, especially when the growth rate of L is as fast as 
exponential. The authors of [13] realize that the resulting approximation is definitely 
not valid when R > 1, which yields Q(E, M) < 1. The point, however, is that it is not 
quite clear whether this approximation is reliable even when R < 1. The fact that the 
resulting approximation below R = 1 does not lead to an obvious absurd is not enough 
to guarantee that the approximation is reliable. 

The second concern is that there is a difference between calculating the expectation 
of Q(E, M) when E and M are fixed and deterministic, and calculating the expectation 
of Q(E, M) when M = M(er) and E = E(&) = J2i aiO~i, because the latter is a random 
variable. The former quantity is what Sasamoto calculated and the latter is actually 
the relevant quantity for analyzing the data compression scheme. When computing 
the expectation of Q(E(a-), M(<r)), the randomness of E(&) is induced by the same 
set of random variables {a,j} that generate also the values of E(er) pertaining to all 
other binary vectors {cr}. In other words, in this calculation both the function Q(-, •) 
and its first argument E(&) fluctuate together, depending on {aj}. Indeed, for one 
thing, Q(E(&), M(<r)) (and hence also its expectation) must always be at least as large 
as unity (by construction), whereas the expectation of Q(E,M) for fixed E and M is 
shown in [12] to decay exponentially to zero for R above the threshold. This is clearly 



§ In [pj] there are detailed discussions about the validity of the saddle point method when L is a 
function of N (see the ending paragraph of Section 3 on page 9559 and pp. 9563-9564 therein). 
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a contradiction. 

In this work, we first propose a rigorous approach to evaluate the expectation 
of fi(E(&,M(&)), which is valid for every R > 0. Our starting point is (or can be 
interpreted as) essentially the same inverse transform integral of the above-mentioned 
partition function (but with a slight modification to account for the above discussed 
replacement of a fixed E by E(a)). However, unlike in [12] and [13], we avoid the use of 
the saddle point method in the evaluation of this integral and we propose a more refined 
analysis instead. The final result of this analysis is similar to that of [32] for R below 
the threshold, but it is not quite identical above the threshold: We show that when the 
relative frequencies of +1 and —1 in <x are p and q, respectively, and L = 2 NR , 

(n(E{&), M(o-))) = 1 + 2*^-^, (2) 

where (•) denotes expectation w.r.t. the randomness of {a^} and = means equality in 
the exponential order sense (ajv = bjy means that -^ln|^- — > as N — > oo). Thus, 
indeed there is a phase transition at R = h(p): For R < h(p), there are exponentially 
many source vectors that are mapped to the same value of E(er) on the average, but 
for R > h(p) the expected number of additional source vectors (other than <x) vanishes. 
Since Q(E(&), M(&)) is an integer-valued random variable, this also means (by the 
Chebychev inequality) that Pi{Q(E(er), M(<x)) > 1} also vanishes for R > h(p). 

While the final conclusions of our analysis are essentially the same as in [T2] (for 
R < h(p)), the message in this paper is three-fold: The first message is that it is not 
necessary to resort to the saddle point method in this case and it is possible to make the 
analysis rigorous as we show. The second message is that our analysis extends easily to 
more general situations, like larger source alphabets, availability of side information at 
the decoder (a.k.a. Slepian- Wolf encoding P, Section 15.8], [14j), and so on. The third 
message is that this analysis method can be used also to handle non-trivial alternative 
coding schemes, like a scheme based on the unconstrained subset sum problem. It turns 
out that for such a scheme, the phase transition occurs at a critical value of R which 
is different from the entropy h(p), and we provide an explicit expression, which is not 
trivial. 

2. Constrained Subset— Sum Coding 

Consider first the problem of counting the number of solutions {cr} to the two 
simultaneous equations 

E{a) = E{&) (3) 

M{a) = M(<x) (4) 

Denoting the Kronecker Delta function by 5{-) and \/—l by i, we have 

n(E(&), M{&)) = Y, S(E(&) - E(<t))S(M(&) - M{tr)) 

(7 
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where the summation over n and fe in the second line is over {0, 1, . . . ,N+} x 
{0, 1, . . . , jV_} \ {0,0} and where A" is the number of vectors ex = (ai,...,a n ) G 
{1,2, . . . ,L} n with S" =1 aj = s. The last line of eq. (JTj) has a simple interpretation: 
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Let M + = {i : a { = + 1}, A/1 = {i : a t = -1}, A/"+ = {i : a, = +1}, and 
A/1 = {i '■ cr.i = —1}. Obviously, E{&) = E(&) if and only if 

Yl a i = £ ^ ( 8 ) 

i 6A/+ nXL t e A/"- rW+ 

Also, for every cr with the same composition as <x, |A/+ fl A/l| = |A/1 PI A/+|. Let then 
n = | A/+ fl A/1 1 = | A/1 fl A/+ 1 . Every cr with the same composition as a corresponds 
to a choice of particular subsets (A/+ fl A/1 and A/1 fl A/+, both of size n) of A/1 and 
A/+, respectively For a given n, the number of combinations of these subsets is the 
product of the binomial coefficients in the last line of (JTj). For each such combination, 
the probability of the event (jHJ) is X} S (A™/L n ) 2 . The last line of eq. (J7|) exhausts the 
product of this probability by the number of combinations for all possible values of n. 

So far our analysis has been exact. We now need an evaluation of the exponential 
order of A™, where s scales like nL, i.e., s = (nL, ( e (0,1), and we expect 
the behavior to be symmetric in ( about the point £ = 1/2. So it is enough to 
cover the range ( G (0,1/2). The event J27=x a i = s is obviously equivalent to the 
event J27=i x i = C n ? where X{ = otijL. The number of points x = (xi,...,x n ) € 
{1/L, 2/L, . . . , 1 — 1/L, 1}" with Y%=i ^i — C n is exactly the same as number of points 
(xi, . . . , x„_i) G {1/L, 2/L, . . . , 1 - 1/L, l}"" 1 which satisfy n( - 1 < E?=i *i < n(, 
which is with excellent approximation given by L n_1 times the volume of the region 
within the unit cube [0, l]™^ 1 of continuous valued (n — l)-vectors (yx, . . . ,y n -i) that 
satisfy n( — 1 < Y17=i V* — n C ( see Appendix A for more details). This volume in turn 
has an exact formula (see, e.g., [3, Theorems 1,4]), which is given in the form of a sum 
of expressions with alternating signs, but it is not trivial to assess the exponential order 
of this formula in a compact manner. 

Alternatively, we may think of the volume of the set {n( — 1 < J2?=i Vi < n (} a s 
the probability of the event {n( — 1 < ^7=1 ^ — n C}) where {Yi} are i.i.d. random 
variables all uniformly distributed over [0, 1]. Now, denoting the unit step function by 
u(x) and using the fact that it is the inverse Laplace transform of the function 1/s, i.e., 
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This integral is easily evaluated using the saddle point method. In Appendix B, we 
show that the highest modulus of the integrand along the integration path, which is the 
vertical straight line Re(s) = c, is uniquely obtained at s = c, which yields 

Pt ln( - 1 <J2Yi < n(\ = e - n ^° (10) 

where 

$(C)=max[lnt-ln(l-e- 4 )-a], (e(0,l/2) (11) 

and we extend the definition of $(•) to the interval (0, 1) to be symmetric around 
( = 1/2, namely, $(1/2 — C) — ^(0- Note that maximizing t is the saddle point of 
the integral OH]), namely, it is the point at which the derivative of the expression in the 
square brackets vanishes. Also, the axis jH Section 5.4, p. 84] of this saddle point is in 
the vertical direction, which is the natural direction of integration path anyway. Thus, 
we now have 

K n s ^L n - l -e- n ^ ] ; s = (nL, (e(0,l). (12) 

On substituting this into the inner summation of ([7]), we get 
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Thus, for L = 2 NR , we have 

(£l(E(&), M(&))) - 1 = 2 N W>- B l, (15) 

which means that there is a phase transition at the critical rate of R c = h(p), 
as mentioned earlier. For R > h(p), the expression (fl(E(&),M(&))) — 1 decays 
exponentially, and hence so does Pr{Q(E(&),M(fr)) > 1}, which means that the 
decoding is unambiguous and correct with high probability On the other hand, for 
R < h(p) the probability for the existence of many additional solutions {cr} must 
be very high: Since the number of typical source sequences (i.e., sequences with 
M(tr) = M(<x) = N(p — q)) is exponentially 2 Nh ^ and the number of distinct values 
of E{&) cannot exceed N ■ L = N ■ 2 NR , the fraction of sequences {cr} (with the given 
composition) that are unique solutions to the equation E(er) = E(&) cannot exceed 

N2 NR/ 2 Nh(p) = 2 -N[h(p)-R}^ and SQ5 p T {n(E(fr),M(fr)) > 1} > 1 - 2~ N ^)- R y 

This result is actually quite expected. The critical rate R c cannot be strictly 
larger than h(p) because, as said, the total number of sequences with composition 
(p, q) is exponentially 2 Nh ( p ' > : Had R c been larger than h(p) we would have obtained 
that (Q(E)\ grows with an exponential rate which is faster than h(p) (at least when 
L is subexponential), which is impossible. On the other hand, R c cannot be strictly 
smaller than h(p), because then it would mean that Sasmoto's coding scheme achieves 
a compression ratio that is better than the entropy. Thus, R c must be equal to h(p). 

The above derivation extends straightforwardly in several directions (one at a time 
or simultaneously): 

1. Lossless source coding in the presence of correlated side information at the decoder. 
Consider the case where the decoder has access to a side information sequence r = 
(ri, . . . , T/v), which is correlated to the source sequence according to a given joint 
distribution P(a,r), and the two sequences are i.i.d. over time, i.e., 

N 
P(<T,T) = l[P(cJ l ,T i ). (16) 

i=l 

It is well known (see, e.g., [3, Section 15.8]) that in this case, the best achievable com- 
pression ratio is given by the conditional entropy of the source given the side information, 
even if the encoder does not have access to the side information (Slepian-Wolf coding 
[14]). Here we propose an alternative way to achieve this optimum compression ratio 
based on subset sum encoding: The encoder works essentially in the same manner as 
before. The decoder seeks solutions to the equation X^ a i°"i = E{&) only within the 
set of er-vectors whose joint empirical distribution together with the side information 
sequence is close to the joint distribution P{a,r) (within some small tolerance). In 
this case, an analysis similar to the above, reveals that the critical rate is given by the 
conditional entropy of the source given the side information. 
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2. Multiple Subset Sums. Instead of one set of random variables a±, . . . , a^, randomly 
drawn in {1,2,..., L}, consider an array of random variables {af}, i = 1,2, ... ,N, 
k = 1,2,..., m, all statistically independent, where each af is uniformly distributed 
in {1, 2, . . . , Lk}, where L^ = 2 NRk , Rt > 0. The encoder encodes er into 
(M(cr), Ei(cr), . . . , E m (er)), where each Ek(a) = X^ afcj is represented by log(NLk) = 
log iV + NRk bits. The decoder reconstructs a as the first vector whose encoding agrees 
with the given compressed input (M(cr), E\{&), . . . , E m (<r)). It is easy to show that the 
decoding is unambiguous with high probability iff J2k R-k > H. Thus, here the phase 
transition occurs at the whole hyperplane Y^k R-k — H. 

3. General finite alphabets. Another natural direction of extending the above result is 
from the case of a binary source alphabet to a general finite alphabet which, without 
loss of generality, will be assumed to be S = {1, 2, . . . , K }. A simple strategy is to 
decompose the problem into K — 1 binary encoding problems, and in each one of them 
we can rely directly on the binary code construction. Let the input source string a 
have N s = Np s occurrences of a t = s E E, s = 1, 2, . . . , K (of course, Y,f=i N s = N). 
Now, for each s = 1,2, ... ,K — I, let us select independently at random a{, a s 2 , . . . , a s N , 
each one drawn under the uniform distribution over the integers {1,2, . . . ,L S }, where 
L s = 2 NRs and R s will be specified shortly. The resulting random selections are 
revealed to the encoder and the decoder. The encoder works as follows: Given the 
source vector a, we first encode the numbers N\, N2, . . . ,Nk-i as overhead (just like 
the transmission of M(a) in the binary case). Next, for each s = 1, ... ,K — 1, we 
calculate E s (cr) = J2 i: ai=s af — Y,i : ai>s a i an d transmit its value using log(iV • L s ) bits. 
The role of each E s (cr) is to represent the information pertaining to all locations where 
Oi = s. Based on the results of the binary alphabet case, in order to decode Ei(cr) 
unambiguously, R\ should be at least as large as h{pi) (think of encoding the binary 
sequences {I (at = 1)}, where X(-) is the indicator function). For the next stage, the 
task is to fill in N2 out of the remaining (N — N±) locations by s = 2, and so we have 
reduced the problem to that of encoding the binary sequence {X(<Tj = 2)} i: a .^i of length 
(N — Ni). By the same reasoning then, to decode ^(c) reliably, R2 should be at least as 
large as (1 —pi)h(p 2 /(l —pi))- This procedure continues until s = K — 1, where reliable 
decoding of -Ex-i(s) requires R K ~i > (1 -p\ -■■■ -p K -2)h(p K -i/(l -p\- ... -p K --i))- 
The overall coding rate (neglecting the overhead) is then 

h( Pl ) + (1- Pl )h ( T ^~) + . . . + (1 -Pl - ■ ■ .- P K-2)h ( PK - i ] 



1-PlJ \ 1_ Pl _ ••• -VK-2) 

which is easily shown (using the chain rule of the entropy) to be identical to the entropy 
of the source 

K 

H = -^Ps^ogp s . 

s=l 
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Returning to the binary case, suppose next that we wish to make the mapping from 
a to E{&) essentially one-to-one over the entire source vector space { — 1,+1} N , and 
then there would be no need to specify the composition of a to the decoder. This 
corresponds to the unconstrained subset sum problem. How large should R be now? 
Here, the analysis is similar but somewhat more involved. The point in presenting the 
analysis for this case is not quite motivated by the usefulness of the data compression 
scheme itself, but more about demonstrating the applicability of the analysis method. 
This time, the derivation is as follows: 
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Taking now the expectation w.r.t. the randomness of {a?}, we readily obtain 
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Assuming that the binary vector (<7i, . . . , &n) is governed by a binary memoryless source 
with p = Pr{<Tj = +1} = 1 — Pr{<Tj = — 1} = 1 — q, we next take an ensemble average 
w.r.t. the randomness of {&{}, and obtain 
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Now, similarly as before 
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where 



^(/?) = niin 

0<x<min{/3,l-/3} 



P*l?s)+(i-P)* ' 



where /3 = r/fc. Denoting a = k/N and substituting this into eq. ffT9|) , we obtain 

(n(JB(*))> - 1 

L k\k )h\r) Pq 

= — -exp jiVmax[/i(a) + a max(/i(/3) + f3lnp + (1 - (3) In q - if) (f3))} 

= — ■ exp < iVmax[/i(a) — amin(D(/3||p) + i/)(/3))] 
= — ■ exp < iVmax[/i(a) — a£,(p)} 

_ 1 . 2 A f log 2 [l+e-«( p »] 5 / 2 1) 

where we have defined 

£(/%) = /31n^ + (1-/3) In ^ (22) 



and 



e(p)=nun[£>(/3|b)+V(/3)]. (23) 
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Again, if L = 2 NR , then the phase transition is now at R = R c , where 

i? c = log 2 [l + e-^]. (24) 

Note the special care should be exercised in the extreme cases where p = and 
p = 1. It is easy to see that in these cases D((3\\p) = oo for all (3, except (3 = p but when 
(3 = and (3 = 1, ip{(3) = oo, thus the sum D((3\\p) +ip{(3) is infinite for every (3 G [0, 1]. 
The choice a = is actually not allowed in the out-most maximization over a since the 
sum over k begins with k = 1. Thus, for p = and p = 1, we have R c = — oo, which 
means that (£l(E)\ = 1, as expected. 

Obviously, R c cannot be smaller than the entropy of the sourceljj] In general, R c 
is larger than the entropy, but the coding rate of the corresponding data compression 
scheme need not be as large as R c . By applying variable-rate entropy coding to E(er), 
taking advantage of the non-uniform distribution of this random variable, one can 
compress it at a rate very close to the entropy of the source. While in this case, R c 
no longer has the meaning of coding rate, it does another meaning, which is related to 
the storage requirement for saving the numbers {<2j}. Note that for R = 1, it is easy 
to specify a particular set of integers {<2j} that yields a one-to-one mapping from a to 
E(cr): By setting aj = 2 J_1 , E(er) becomes the standard binary representation of cr (up 
to a fixed shift). The above result tells us that we can manage with less storage since R c 
is in general less than 1, except the case p = 1/2, where R c = 1 (see also [9, Proposition 
7.6 and Exercise 7.8]). 
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Appendix 

A. Estimating A" 

Given a lattice and given a certain region in space, the number of lattice sites in that 
region is approximately given by the volume of the region divided by the volume of 
a single Voronoi cell pertaining to that lattice. This approximation improves as the 
ratio between these two volumes becomes very large. More precisely, there is a slight 
correction due to the fact that some of these Voronoi cells may not be entirely included 
in the region in question. To obtain upper and lower bounds on the number of lattice 
points, one may slightly expand (for an upper bound) or shrink (for a lower bound) the 
given region by an amount that guarantees full inclusion (for an upper lower bound) or 

Had it been smaller, we could have compressed at a rate below the entropy using Sasamoto's coding 
scheme, which is a contradiction [3]. 
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full exclusion (for a lower bound) of the partially included Voronoi cells that are near 
the boundaries. In our case, we have a cubic lattice in (n — 1) dimensions with spacings 
of 1/L in each dimension, so the volume of a Voronoi cell is 1/L n ~ 1 . By replacing nC, 
with n( ± n/L, one can obtain the aforementioned upper and lower bounds, but the 
corrections of ±n/L to n( and to n( — 1 have negligible effects in the exponential scale 
since L grows exponentially with n and hence n/L vanishes in the limit. 

B. Saddle Point of the Integral (TJ|) 

The modulus of the integrand depends solely on the real part of the exponent of the 
integrand, namely, on Re{sC + ln(l — e~ s ) — Ins}. Now, consider an arbitrary complex 
number s = r + iu>. Then obviously, 

Re{< + i n (i _ c -.) _ i n s} = r( + I i n ^ + e~ 2r - 2 e ~ r cos uj \ 

2 \ r z + u z J 

Thus, we have to show that 

1 + e~ 2r — 2e~ r cosu; 

r 2 + u 2 

is maximized at u = 0, and only at u = 0, for all r > 0. This would be equivalent to 

the assertion that 

(1 - e~ r f ^ 1 + e~ 2r - 2e~ r cosw . oe . 

2 ~, — 2 ' ( 26 ) 

r r + ui 

with equality if and only if uj = 0, since the left-hand side is obtained from the right- 
hand side by setting u = 0. 

In order to prove eq. fl26|) . we begin from the obvious inequality 6 > sin(#), which 
holds for all 9 > 0. Integrating both sides of this inequality from to w (w > 0), we 
obtain u 2 /2 > 1 — cos(w) with equality if and only if uj = 0, or equivalently, 

; ^ ( s > 2, (27) 

1 — cos(cj) 

which now holds for all u since the left-hand side is an even function. Similarly, 
beginning from the inequality 1 < cosh(r) and integrating both sides twice, we get 
r 2 /2 < cosh(r) — 1, or equivalently 

ty'-' 

(28) 



cosh(r) - 


- 1 " 


- ^i 


which when combined with 


flZZD yields 


cosh(r) - 


< 
- 1 ~ 


u 2 
1 — cos(o;) 


or equivalently, 






2r 2 e" r 


<? 


u 2 



(29) 



(l- e -r)2 -' 1_C0S(W)' (30) 

which is 

2r 2 e~ r - 2r 2 e~ r cos(w) < u 2 (l - e- r ) 2 . (31) 
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Adding r 2 (l — e~ r ) 2 to both sides of this inequality and rearranging terms, one obtains 
r 2 [l + e~ 2r - 2e~ r cos(w)] < (r 2 + w 2 )(l - e~ r )\ (32) 

which is equivalent to eq. ( )26i) . 
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